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Abstract — Small-world networks are networks in which the 
graphical diameter of the network is as small as the diameter 
of random graphs but whose nodes are highly clustered when 
compared with the ones in a random graph. Examples of 
small-world networks abound in sociology, biology, neuroscience 
and physics as well as in human-made networks. This paper 
analyzes the average delivery time of messages in dense small- 
world networks constructed on a plane. Iterative equations 
for the average message delivery time in these networks are 
provided for the situation in which nodes employ a simple greedy 
geographic routing algorithm. It is shown that two network 
nodes communicate with each other only through their short- 
range contacts, and that the average message delivery time rises 
linearly if the separation between them is small. On the other 
hand, if their separation increases, the average message delivery 
time rapidly saturates to a constant value and stays almost the 
same for all large values of their separation. 

I. Introduction 

Many networks arising in science [1], technology [2] and 
society [3] exhibit complex connections by means of small 
numbers of long-range links, and are surprisingly closely 
connected despite their extent. Such networks have become 
known as small-world networks. The most striking character- 
istic of small-world networks is their small graphical diameter 
even though they are highly clustered. One important field 
of science in which small-world network structures frequently 
arise is the theory of social networks where the connections 
among people in a social network are studied [4]. Social 
networks will be the major focus of this paper; however, the 
results obtained for the average delivery time of messages are 
also valid in similar small-world network models appearing in 
other fields of science and technology. 

Papers such as [5] and [6] are experimental verifications 
of the small-world phenomenon in social networks. Watts and 
Strogatz proposed an elegant small-world network model in [7] 
which enables mathematical analysis of small-world networks. 
Algorithmic perspectives are investigated in [9], [10] and [11]. 
In this work, we also focus on the algorithmic perspectives 
of small-world networks, and analyze the average message 
delivery time when nodes run a greedy geographic forwarding 
algorithm to deliver a message to the final destination. We 
obtain an exact recursive expression for the average message 
delivery time in a small-world network rather than providing 
bounds as in [10] and [11]. 

The research was supported by the U.S. National Science Foundation under 
Grants ANI-03-38807 and CNS-06-25637. 



II. Network Model 

To investigate the average message delivery time in small- 
world networks, we consider an i? by i? square as our network 
domain. Let V represent this network domain, and d{x, y) 
be distance between any two points x and y belonging to 
v. We distribute n relay nodes randomly (from a uniform 
distribution) over the network domain. Locations of source s 
and target t nodes will be assumed to be arbitrary so that 
we may analyze the average delivery time of a message as 
a function of their separation. We let Xg be the location of 
the source node, Xt be the location of the target node and 
{Xi}^^i be locations of n relay nodes. 

For each n > 1, we define 7i„ = {Xi}f^^. The random 
set Hn will be called the node-location process. We define 
the continuum limit as Hoc — U^i^n- ^i^ important 
topological property regarding Hoc is the following. 

Theorem 1: With probability one (w.p.l), Tioo is a dense 
subset of the network domain. 

Proof: Let (ri,5,P) be the underlying probability space 
on which all of the random variables are defined. Let Q 
be the set of rational numbers, and Q^ = Q x Q. Let 
r = PflQ^- Choose any a; e T. For any small ball 
B \x,-] C V around x with radius - and p being an 
integer greater than or equal to ifM let Z„ = T^^=i ^j' ^^'^ 
^i = ^ix^B(x iH- Since, under our model, the ^^'s are 
independent and identically distributed (i.i.d) random variables 
with finite means /i > 0, we have — — > /i w.p.l. Observe 

that Zoo - \Hoo(^B(x,^^\ > Win^B(x,l)\ = Z„ for 
all n > 1. Thus, Z^o > lim„^oo ^n = oo. As a result, there 
will be infinitely many points of Tioo lying in B (x, -) w.p.l 



Let flx,p C 51 be the set on which Tioo has infinitely many 
points in B (x,^). We define flo as fin = HxeT fl^i ^x,p- 
P{ilo} = 1 since it is an intersection of countably many sets 
having probability measure one. Take an a; e Hq. Now, for 
any x G V and e > 0, we can find z G T and p> I such that 
B (z, A C B{x,e). Therefore, noo{i^)f]B{x,e) = oo. This 
completes the proof ■ 

A. Connectivity Properties of Nodes 

We assume that each node in the network maintains both 
local and long-range contacts as in other existing small-world 
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network models. We first describe how the set of local contacts 
of a node is formed. 

(i) Local Contacts of a Node: All nodes in the network 
have a uniform communication range r. Nodes lying inside 
this communication range of a node form its local contacts. 
Let us consider a particular node i with location Xi. Then, its 
local-neighborhood is the ball B{Xi,r). 

(ii) Long-range Contacts of a Node: In addition to its 
local contacts, each node maintains some number of long- 
range links for communication. We assume that there are two 
types of long-range links: (i) incoming long-range links and 
(ii) outgoing long-range links. A node can receive a message 
through an incoming long-range link but cannot send any data 
over this link. On the other hand, a node can send a message 
over an outgoing long-range link but cannot receive anything. 
Such a differentiation between incoming and outgoing long- 
range links helps in simplifying some of the mathematical 
details. 

B. Limiting Process for Forming the Outgoing Long-range 
Contacts 

We now put forward the rules for the limiting process for 
forming outgoing long-range contacts as the number of relay 
nodes goes to infinity. 

1) Rules for fondling outgoing long-range contacts: Source 
and target nodes are only allowed to choose their outgo- 
ing long-range contacts among relay nodes. Relay nodes 
are only allowed to choose their outgoing long-range 
contacts among relay nodes. A node chooses the node 
at the receiving end of an outgoing long-range link at 
random uniformly among all other nodes which do not 
lie in its local neighborhood. After a node chooses a 
long-range contact, it is not allowed to change it as new 
relay nodes are added to the network. 

2) Initialization and the limiting process: We initialize the 
number of relay nodes to m where m is an arbitrary 
integer greater than 1, and uniformly distribute m relay 
nodes over T). We place source and target nodes at arbi- 
trary positions in V. Nodes then choose their outgoing 
long-range contacts as explained above. We send the 
number of relay nodes to infinity by adding new relay 
nodes. As a new relay node is added, nodes which have 
not been able to pick an outgoing long-range contact 
yet are given a chance to form a long-range outgoing 
contact with this new relay node. Nodes which have 
akeady chosen their outgoing long-range contacts are 
not allowed to change them. 

Let Eni be the event that node i e {s,t}lj{l,2, • • • , n} 
has an outgoing long-range contact when there are n relay 
nodes located in V. On this event, node i's outgoing long- 
range contact will be uniformly distributed over 'D — B{Xi, r). 
Therefore, the probability that node i has an outgoing long- 
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Fig. 1. A typical realization of the network. 



We have the following theorem when there are infinitely 
many relay nodes located in V. 

Theorem 2: For a given node i E {s, t} IJ{1, 2, 3, • • • } and 
a given subset A of T), 



F{3j in Af^n^ s.t I "^ j\X,} = 



h^ 



A'^{zeA-B{X,,r)} 



dz 



Proof Let En^i be defined as above, and A„_i — 

LRC 

{3j in Ai^Ti-n S.t i — > j}. {S„,J„>„i is an increasing 
sequence of events since nodes are not allowed to change 
their outgoing long-range contacts once they choose them. 
Therefore, whenever a node has an outgoing long-range 
contact for some n > to, it also has an outgoing long- 
range contact for all k > n. Similarly, {A„,i}„>m is an 
increasing sequence of events. Let i?oo,i = Un>m ^",i ^'^'^ 
Aoo.i = Un>m -^ri.i- Sincc Tioo IS a dense subset of V, we 
have P(i?oo.i) — linin^oo P(-B„,i) — \. Observe also that 
Aoc.i = {3j in ^n^oo s.t i — > j}. Then, by using the 
continuity of measure from below [12] and Bayes theorem, 
we have 



P(Ao 



lim P(A„.,|^„,,)P(£;„.^) 

n — )-oo 

It, ^{zev-B{Xi,r)}<iz 



{^eA-B(Xi.r)}^ 



er>-B(Xj,r)} 



dz' 



A typical realization of the network, and the probability 
with which nodes form their outgoing long-range contacts are 
depicted in Fig. ?? and Fig. |2] respectively. 

C. S-Greedy Geographic Forwarding Rule 

Nodes use a simple (5-greedy geographic forwarding (S- 
GGF) rule to deliver their messages to their intended target 
nodes. In S-GGF, the only global information needed to deliver 
a message is the location of its final destination, which can be 
encoded inside the message by the message originator 

According to this rule, any message holder having distance 
d E [kr, {k + l)r) to Xt for some fc > 1 chooses one of 
its local contacts lying in B{Xt, kr) and providing a forward 
progress at least r — S in the direction of the target node for 
some (5 > as a next hop local contact. After a node chooses 




Existence probability 
of such a link is 

X4l{gEA-B(j:.,r)}dz 
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Fig. 2. An illustration of the probability with which a node chooses its 
outgoing long-range contacts. 



its next hop local contact, it is not allowed to change it as 
new relay nodes are added to the network even if there are 
some other local contacts providing forward progress larger 
than the chosen one. This rule can be interpreted as that nodes 
have bounded greediness while choosing their next hop local 
contacts. 

If the current message holder has an outgoing long-range 
contact providing forward progress in the direction of the 
target node larger than the progress provided by the local 
contact chosen as described above, this long-range contact is 
preferred over the local contact. 

III. Message Trajectory and Average Message 
Delivery Time 

A. Message Trajectory 

In accordance with the search algorithm, a message wanders 
around the network hop by hop until it reaches the target. 
When there are n relay nodes, we denote the resulting random 
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trajectory of the message by M^. ' , where A/^, ' is the location 
of the message at the fcth step. AIq is always equal to Xs- We 
define the message delivery time t„ under 7i„ to be equal to 
the first time the message reaches the target node. Therefore, 
T„ can be written as t„ — inf{fc > : Af^." = Xt}. Note 
that T„ = oo if the forwarding rule cannot find a path from 
the source node to the target node. r„ also depends on the 
locations of source and target nodes since the message starts 
its journey at Xg and finishes it at Xf. However, we will not 
explicitly write it as a function of Xg and Xt by keeping this 
fact in mind. 

Definition 1: We say a forwarding rule is loop-free if it does 
not form any loop while delivering the message to the target 
node. 

Definition 2: We say a forwarding rule is convergent if for 
any given Xg and Xt, r„ converges to a limiting real- valued 
random variable w.p. 1 as n goes to infinity under the rules in 



II-B for forming long-range contacts. 

For example, the shortest path routing is a convergent 
forwarding rule since the message delivery time monotonically 
decreases as new relay nodes added to the network domain. 
For convergent forwarding rules, we let lim„^oo t„ = t be 
the message delivery time under Tioo- 

Let !F denote the collection of all convergent and loop-free 
forwarding rules. We have the following theorem establishing 



the relationship between E[t„] and E[t] for this type of rules. 

Theorem 3: Fix Xg and Xt- Then, for any forwarding rule 
in J^ with P{r„ > B} = o{n^'^) for some finite B > 0, 

lim E[r„a{,„<^}]=E[r]. (1) 

n — *oo 

Proof: Since the forwarding rule is loop-free, we have 

'''nl{r„<oo} = Tn^lT^KB} + '''nl{B<r„<n+l} • 

Since r„ converges to r w.p.l, we have P{r > B} = 0. 
Therefore, by using bounded convergence theorem, we have 
lim„^oo IE[t„1{t-^^<5j.] = E[t]. By using the condition on the 
decay rate of the tail of the distribution of r„, we have 

< lim E[Tnl{B<r„<n+i}] < lim {n + l)P{r„ > B} 

= 0. 

As a result, lim„^oo E[T„a{^^<oo}] = E[t]. ■ 

We now show that Theorem |3] holds for (5-GGF. To this end, 
we need to show a series of lemmas. 

Lemma 1: (5-GGF is a convergent forwarding rule for any 
d>0. 

Proof We prove this lemma by induction. Consider the 
event Hq on which Hoq is a dense subset of V, and let d — 
d{Xg,Xt). T„ = if d = 0, and r„ = 1 if < d < r. 
Therefore, it is convergent for < d < r. Take any uj ^ Hq. 
Since 7ioo('jj) is a dense subset of V, the message eventually 
enters B{Xt,r), and therefore t„(w) converges to 2 for any 
uj E Hq and for all d G [r,2r). Similarly, it can be shown 
that r„ either converges 2 or 3 on Hq for all d E [2r, 3r). 
Now, assume that t„ converges on ilo for all k < K and 
kr < d < {k -\- l)r. We will show that it is also correct 
for any given d G [{K + l)r, {K + 2)r). Take any uj e fio- 
Note that B{Xg,r) f] B{Xt, {K + l)r) has positive area. Since 
Tiaoi^) is ^ dense subset of I?, M{^'{uj) will eventually lie in 
B{Xt, {K-{-l)r) through either a short-range contact or a long- 
range contact for all n large enough. Since the next hop short- 
range and long-range contacts are fixed once they are chosen, 
the relay node holding the message after the first transmission 
will be the same for all n large enough. Now, this relay node 
located at M{' (uj) and currently holding the message can 
be interpreted as the new source node of the message. By 
induction hypothesis, the message delivery time starting at this 
relay node converges. Therefore, t„ also converges on ilo for 
allde[{K + l)r,{K + 2)r). ■ 

/i(fc) _ 1 

/2(fe) ^ -^■ 

-> cx). Then, 



We write fi{k) ~ f2{k) as fc ^ oo if limfe_ 
Lemma 2: Let Cfc — > and b^. 



oo as fc 



(l+ak) " ^ exp(afc6fc) if and only if (ak) bk — > as fc — > oo. 
Lemma 3: For any given source-target separation d = 
d{Xg, Xt) and 6 > 0, there exists a constant B > such that 
T„ under (5-GGF satisfies P{t„ > B} < exp{—c-f{n)), where 
c > is a constant independent of d and 6, and f{n) ^ v?'^ 
for some e e (O, |). 

Proof Choose an e belonging to (O, 4). Divide T> into 



small sub-squares with side length e„ 



Let h{n) 



Thus, 



n2 



(2) 



712 ^ be the number of small sub-squares. Index sub- 
squares by j G {1,2, ••• ,h{Ti)}. If Sj represents the jth 
sub-square, we have V = Uiii '5'j- Index relay nodes by 

iG {1,2,--- ,n}. Then, P{X, G 5j} 

P{X,^Sj,ie{l,2,--- ,n}} = 
By using Lemma [2] we have 

P{X, iS,,ie {1,2,- ■■ ,n}} ~ exp 

< exp (-0.25 • n'^") . 

Let E„ ^ {lu e n : 3j e {1,2,- •• ,h{n)} s.t. X,{uj) ^ 
Sj,yi G {1, 2, • • • , n}}. By using the union bound, we have 



(3) 



¥{E„) < h{n) 1 



exp(-c./(n)). 



where c = 0.25 and /(n) = n^^ -41og(2) -4(1 -2e) log(n). 

On the complement E'^-^ of the event £"„, the message 

can make forward progress at least r — 5 amount towards 

destination at each hop for all n large enough. Let B = 



^ + 1. Then, E[^ C {t„ < B}. Thus, 



>B} 



< 
< 



1 - P«) 
exp(-c- f{n)). 



Theorem 4: If nodes use the (5-GGF rule to relay mes- 
sages, then r„ converges to a limiting random variable r and 

lim„^ooE [t„]1{^„<oo}] = E [t]. 

Theorem |4] allows us to relate the average message delivery 
time for dense networks with finitely many nodes to the 
average message delivery time in the continuum limit for the 
(5-GGF rule. Therefore, we will focus on the continuum limit 
and the (5-GGF rule for the rest of the paper. 

B. Average Message Delivery Time 

We now calculate the average message delivery time in 
small-world networks in the continuum limit when relay nodes 
are uniformly distributed over T). We let d = d{Xs,Xt) be the 
distance between source and target nodes. In order to eliminate 
possible edge effects that can occur, we place the target node 
at a position Xt satisfying B{Xt, d+r) C T). This assumption, 
in turn, necessitates R be greater than 2- (d + r). One can also 
extend the analysis presented below to small-world networks 
constructed on the surface of a sphere and on the surface of 
a torus where we do not run into edge-effect problem. Due 
to space limitations, we focus here only on the rectangular 
network domains. 

Consider the 5-GGF rule for some S > 0, and concentric 
cylinders C{Xt,kr, {k + l)r) = B{Xt, {k + 1)^) - B{Xt,kr), 
fc > 0, centered at Xt with thickness r. In the continuum limit, 
it is not hard to see that the progress of a message toward the 
destination node at each hop is at least r — 6 units, and the 
message enters to an inner concentric cylinder from an outer 
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Fig. 3. Exemplaiy trajectories of the message when 2r < d(Xs,Xt) < 3r. 



one due to Hoc being dense in T). As a result, if nodes do not 
have any long-range contacts, it can be shown that r satisfies 
the following theorem. 

Theorem 5: Under (5-GGF with no long-range contacts, r, 
with probability one, is equal to 





d(X,,Xt) 



(4) 



if X, = Xu 
+ 1 ifd{Xs,Xt)>Q. 

Let us now turn our attention to the more interesting case 
in which nodes maintain long-range contacts in addition to 
those with their local neighbors. Consider any two source 
nodes located on the boundary dB{Xt,d) of B{Xt,d). Due 
to the symmetry of the problem, neither of the nodes is better 
than the other in delivering the message to the destination. 
Therefore, the expected value of r is only a function of the 
separation between the source-target pair, and there is no 
ambiguity in the following definition. 

Definition 3: g{d) = E''[t] is the expected delivery time of 
a packet to the final destination Xt starting from the source 
node Xg when d{Xs, Xt) = d. 

Let us first calculate g{d) for several ranges of d before 
going into a more general solution. g(0) = since there is no 
need to transmit the message. For < d < r, g{d) ~ 1 since 
source and target nodes can directly communicate with each 
other For r < d < 2r, g{d) — 2 since the source can reach the 
target in exactly two hops due to Hoc being dense in V. For 
2r < d < 3r, g{d) is a little tricker to obtain. The key is to 
analyze where the message can be located after the first hop. It 
can be delivered to Xt in two hops if it makes its way through 
B{Xt,r) at the first hop. Otherwise, it first enters to the disc 
C{Xt,r, 2r), and then it is delivered to the target in two hops 
thereafter As a result, the second path takes 3 hops to deliver 
the message to the final destination. Exemplary trajectories of 
the packet are shown in Fig. [3] 

These observations lead to g{d) = 1 + P{0 < d{Mi,Xt) < 
r}+ 2-¥{r < d{Mi,Xt) < 2r}, where Mk, fc > 0, represents 
the location of the message at the fcth hop under Hoc- By 
letting a == P{0 < d{Mi,Xt) < r} = j^i^, we obtain 
g{d) = 1 + ct + 2(1 - a) for 2r < d < 3r. 

General calculations for any d lying in [k-r, (fc + 1) -r), k > 
2, are also in this spirit. We first look at where the message can 
be located at the first hop, then analyze the expected delivery 
time from this point on. This analysis leads to a recursive 



solution for g{d) for any value of d. To this end, we let g{d) = 
go when d = 0, g{d) = gi when < d < r, g{d) — gk when 
{k-l)-r <d< fc-r, 2 <k< L#-1J and 5(d) 
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when \^ - l\r < d < ^ -r. 
Consider 5fc+i when 2 < fc < [|^ 
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IJ . Then, 



9k+l 



1 + E 



z=l 



5j ■ ^{{i-l)-r<d(Mi,Xt)<i-r} 



fc-1 



= l + (l-a(fc-l)2)g, + a .^(21-1)3,. 

1=1 
Note that in the first equality above, there is no loss of 
generality in writing the limits of the indicator function as < 
d{Mi,Xt) < r when i = 1 since ¥{d{Mi,Xt) = 0} = 0. To 
obtain a second order non-constant coefficient linear recursive 
equation, we subtract g^ from ^fc+i. 

9k+i - gk = {gk - fffc-i) ■ (l - a{k - 1)^) . (5) 

Let Uk = gk+i — gk and f3k = l—a{k—l)^. Then, Uk satisfies 
the following first order linear recursive equation. 

Uk = h -Uk-i for /c > 1, (6) 

with the initial condition uq ~ 1. Observe that Uk = Y[i=i Pi 
for fc > 1. Then, the solution for (J5]) is obtained as 

k j 

gfc+i = l + ^[]/3,;forfc>l. (7) 

The following theorem summarizes these findings. 

Theorem 6: Consider a small-world network constructed on 
V, and containing nodes that have local communication range 
r and one uniformly distributed long-range outgoing contact 
on v. If nodes employ the (5-GGF rule to relay messages, then 
the average message delivery time, for 1 < fc < [^ ~ IJ. is 
given by gk+i = 1 + Ej=i DLi A, where gk = g{d) = 
E'*[r] when (fc — l)r < d < kr, g{d) = g\M-i\+i when 

[i-l\r < d < 1-r, a = ^^^, and go = and 
5i = 1 are the initial conditions. 

In Fig |4]-a and Fig. |4]-b , we plotted the change of average 
message delivery time with respect to the separation between 
source and destination nodes for two different network sizes. 
In both figures, the horizontal axis is normalized with respect 
to the local communication range r of nodes. Fig. p] reveals 
that the average message delivery time increases linearly for 
small values of separation between a source and destination 
pair. On the other hand, it quickly converges to a constant, and 
remains essentially the same at this constant value for a broad 
range of values of source-destination separations. This means 
that messages are first forwarded by means of long-range 
contacts until they enter a certain range of the destination. 
From this point on, they are delivered to the destination 
through local contacts. This quantifies the observation of 
Travers and Milgram [5]: "Chains which converge on the 
target principally by using geographic information reach his 
hometown or surrounding areas readily, but once there often 
circulate before entering targets circle of acquaintances." . 
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Fig. 4. Change of average message delivery time as a function of normalized 
source-destination separation, a, _R = 102 -r.h, R = 502 ■ r. In both figures, 
the horizontal axis is normalized with respect to r. 



IV. Conclusions 

Small-world networks arise in many disciplines of science 
including biology, neurology, sociology and computer science. 
In this work, we have focused on the average delivery time of 
messages to a final destination in dense small-world networks 
when nodes use a local geographic forwarding rule to relay 
messages. Existing work on small-world networks only pro- 
vides bounds on this average message delivery time. On the 
other hand, in this paper, we have presented a technique based 
on the first-step analysis for calculating an exact formula of 
the packet delivery time in a small-world network constructed 
on a plane. 
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